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Abstract;  Mew  upper  bounds  are  given  for  the  expected  value  of  a  convex 
function.  The  bounds  employ  subgradient  information  and  the  conjugate 

OA*  t-r,  t-  1 

function.  We  derive  the  bounds  and  compare«tfce«i  with  previous  bounds  with 
different  information  requirements. 
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1  INTRODUCTION 


Evaluating  the  expectation  of  a  convex  function  is  a  central  require¬ 
ment  in  utility  theory  (see,  for  example,  Fishburn  (19701)  and  stochastic 
programming  (see,  for  example,  Dempster  [1980]).  In  genera),  these  problems 
involve  optimizing  the  expectation  of  some  function  of  certain  random 
variables  and  decision  parameters.  We  assume  that  this  function  is  convex 
and  that  certain  properties  of  the  convex  function  and  the  underlying 
probability  measure  are  known.  We  show  that  new  upper  bounds  on  this 
expectation  are  available  when  the  information  includes  subgradient  and 
conjugate  function  information.  This  result  is  especially  useful  when  the 
original  integral  is  not  easily  computable  as  we  show  below. 

The  most  basic  bound  on  expectations  of  convex  functions  is  Jensen  s 
lower  bound  (Jensen  (1906]  which  requires  knowing  only  the  finite  means  of 
the  random  variables.  Madansky  [1959],  following  Edmundson  (1956],  gave  an 
upper  bound  based  on  the  theory  of  moment  spaces.  This  bound  again  requires 
finite  mean  value  information  and  a  bounded  n-dimensional  rectangular 
domain  of  stochastically  independent  random  variables.  Ben-Tal  and 
Hochman  (1972)  extended  and  refined  the  Edmundson-hadansky  bound  by 
including  information  of  the  expected  value  of  the  absolute  difference 
between  the  random  variable  and  its  mean.Gassmann  and  Ziemba  (1986] 
provide  a  weaker  bound  that  does  not  require  independence  (as  in  Dupacova 
1 1974])  or  n-dimensional  bounded  regions.  Frauerdorfer  (1986]  provides  the 
extension  of  the  Edmundson-Madansky  bound  with  dependencies  and  know¬ 
ledge  of  the  joint  expectations  of  the  random  variables. 


The  general  process  of  obtaining  these  bounds  as  solutions  of 
moment  problems  is  described  in  Birge  and  Wets  (1986],  The  solution  of 
linear  approximations  is  given  in  Birge  and  Wets  (19871.  Explicit  solution 
procedures  also  appear  in  Ermoliev,  Gaivoranski  and  Nedeva  (19871  and  Cipra 
i  "'*'61.  They  are  also  used  in  Oula  (19861  to  provide  bounds  for  the  expecta¬ 
tion  of  convex  functions  with  additional  properties  given  first  and  second 
moment  information. 

Our  results  differ  from  the  above  results  in  our  not  requiring  explicit 
moment  information  but  instead  information  regarding  the  conjugate 
function  and  the  expectation  of  the  gradient  and  the  inner  product  of  the 
gradient  and  the  random  vector.  We  first  give  a  one-dimenstional  result  in 
Section  2.  Section  3  provides  an  extension  in  n-dimensions.  Section  4 
compares  our  bound  with  previous  bounds  in  R  and  Section  5  provides  the 
comparison  in  IRn.  Section  6  describes  possible  refinements,  and  Section  7 
gives  conclusions. 

2.  AN  UPPER  BOUND  IN  R 

Let  (Q,  2,  F)  be  a  probability  measure  space  and  let  X:  (a,b)  be 

a  random  variable,  where  -«><.a<b(*<»,  with  distribution  F. 

Let  <$>:  (a,bl  -♦  IR  be  a  convex  differentiable  function  LUe  denote 
expectation  with  respect  to  F  by  E  and  throughout  this  section  we  assume 
that  E$(X),  E$  (XI  and  EX<$>  (X)  exist  and  are  finite 


Theorem  2.1  Let  4>:  (a,b)-»ffl  be  a  convex  differentiable  increasing  function 

i 

and  assume  that  E<|>  (X)>o.  Then, 


EWU*S!=D 


12.1) 


Proof:  For  any  convex  differentiable  function  $  on  (a,b),  the  following 
inequality  holds: 

4>(s)  -  4>(t)  >,  (s-t)  <J>‘  (t)  for  all  s,t  €  (a,b)  (2.2) 


Set  t=X  (clearly  in  (a,b)l ,  s  -  • 

y  E4>’(X) 

Since  is  increasing  and  E<t»'(X)>o,  s€(a,b).  Substituting  s,t  in  (2  2) 
cue  obtain 


E<t>‘(X)  E$'(X) 


(2.3) 


Take  expectation  on  both  sides  of  (2.3)  cuith  respect  to  F  and 
observe  that  in  the  right  hand  side  of  (2.3)  (cuith  E$'(X)>o): 

E$'(X)  -  EX4>'(X)  -  0. 

E4>'(X) 


The  result  follocus. 


□ 


Remarks  21 


(1)  If  <$>  is  strictly  increasing  ana  concave  then  inequality  (2.1)  is  reversed. 

(2)  If  <t>  is  strictly  decreasing,  then  assuming  E't>’(X)<o,  inequality  (2.1)  is 
still  valid. 

(3)  The  differentiablity  assumption  on  <t>  can  be  relaxed.  For  if  $  is 
convex  its  left  and  right  derivatives  4>J(x)  and  <t>'+  (x)  exist,  and  are 

finite  and  non-decreasing.  Moreover  the  subdifferential  of  <t>  is 
given  by,  34>(x)  =  (zeIR  :  <t>'(x)^z^4>'(x)J  (see  e  g  Rockafellar  (l970j,  pp. 
228-229) 

Theorem  2.1  remains  valid  if  cue  substitute  any 
zed  4>(x)  =  (<t>’_  (x),  4>‘+  (x))  for  4>‘(x). 

□ 

Jensen's  inequality  for  a  convex  function  <$>  provides  us  cuith  a  locuer 
bound  for  E<fr(X) : 

4>(E(X))  *  E<t>(X)  (2.4) 

Combining  inequality  (2.4)  cuith  Theorem  2.1  allow  us  to  derive  a  re- 
arranoment  type  inequality 

Corollary  2.1  Under  the  assumptions  of  Theorem  21,  cue  have 

EX$'(X)  >,  £(X)  E4>'(X)  (2.5) 

Proof  Simply  follows  from  (21),  combined  with  (2.4),  and  using  that  4>  is 
increasing  and  E$’(X)  >o.  □ 


More  generally,  let  g:  (a,b)-»IR  be  a  given  increasing  function. 
Since  is  convex,  4>’  is  increasing  and  so  fltj^’lglt))  is  increasing.  Then 
inequality  12.5)  implies 

Eg(X)ffX)  >  EglXI  EflX)  (2.6) 

Inequalities  (2.5)  or  (2.6)  can  be  used  to  obtain  bounds  on  system  reliabil¬ 
ity.  For  general  results  on  rearrangement  inequalities  and  applications 
see  Karlin  and  Rinotl  [l98lj  and  the  references  therein. 

3  AN  UPPER  BOUND  IN  Bn 

In  this  section  cue  present  a  natural  extension  in  IRn  of  the  upper 
bound  derived  in  Theorem  2.1. 

Let  X  be  a  random  vector  on  the  probability  space  (Q.  2,  F)  with 
distr  ution  function  F  and  let  S  C  fin  be  the  support  of  X. 

Assume  that  S  is  convex  and  let  <f>:  S— ♦IR  be  a  convex  differen¬ 
tiable  function.  The  gradient  of  4>  at  x  is  denoted  by  V4>(x).  The  conju¬ 
gate  convex  function  of  <\<  i*  rlofinprl  hij 


4>*(y)=sup  (x^-^lxj) 
x 

In  the  sequel  cue  assume  that  E<t>(X),  EXTV<|>(X)  and  EV4>(X)  exist 
and  are  finite 

Theorem  3.1  E$(X)  *  EXTV$(X)  -  V(EV$(X))  (31) 


Proof  Since  $.  S~*)R  is  convex  and  differentiable,  the  gradient  inequality 
holds,  i  e ,  $(<*)  -  <t>(p)  )  (<*-|3)TV4>(p)  for  all  &,  |3  e  S. 


(3  2) 


Setting  p  =  X  in  (3  2)  and  taking  expectation  with  respect  to  F  in  inequality 
(3  2)  implies 

E*(X)  V'EXT7<MX)  *  'Met)  -  ctTE7<MX)  for  all  a  <E  S. 

Hence, 

E<|>(X)  i  EXT74>(X)  ♦  inf  (<*>(*) "  E7<t>(HI).  (3  3) 

<X 

Mote  that: 

inf  [Mac]  -  a?  E74>(X))  =  -  sup  (cJ  E7$(X)  -  $[<*}} «  -  (E7<fr(X)). 

ct  ct 

Inequality  (3.1)  follows  immediately  from  (3.3).  □ 

Remark  (5.1)  fin  alternative  proof  of  Theorem  3.1  may  be  derived  using 
the  following  useful  relation:  (see  Rockafellar  (l9?0j,  p.  257) 

4>*  (7<J>(z) )  =  zt7<$>(z)  -  4>(z)  13.4) 

Setting  z=X  and  taking  expectation  in  (3.4)  we  obtain 

Ecfr(X)  =  EXT7<MX)  -  E$*  (74»(X)  (5  5) 

But  since  <$>  is  convex  so  is  4>*  and  hence  by  Jensen's  inequality 

<t>*(E74>(X))  $  E<t>*(7<MX))  (3  6) 

Then  (3.5)  combined  with  (3.6)  implied  (3.1).  This  proof  will  be  useful  to 
refine  the  upper  bound;  see  Section  6.  □ 

Remarks  3  2 

(1)  If  <t>  is  concave  Inequality  3.1  is  reversed. 

(2)  fls  mentioned  in  Remarks  2.1  (3),  Theorem  3.1  remains 

valid  if  instead  of  74>(.)  we  substitute  any  ze  a  4>.  □ 


The  one  dimensional  version  of  Theorem  3.1  (n=l.  Sda.bJJ  provides 
us  ujith  the  upper  bound  E<t>(X)  ^  EX<t>'(X)  -  4>*  (E<t>’(Xl):  *  C  The  next  result 
shows  that  the  bound  C  is  hetiw  1 1 the  upper  bound  0  derived  in 
Theorem  2.1. 

Theorem  5  2  Under  assumptions  of  Theorem  2.1,  we  have 

E4>(X)  $  C  *  0  (3.7) 

Proof.  For  any  convex  functio  4>  and  any  cy<p  dnrn  f ,  p  e  dom  <K  the 
inequality 

4>(a)  ♦  $*(p)  >,  ap  (3  a) 

holds  Suhstitutinn  in  (3  Ri  =  FXf  (X)  ^  f-(j  _  ^nrfl  ^  flr„  f 

E<t>‘(;.» 

p  =  E<t>'(X)  e  range  <t>‘  c  dom  $*,  the  result  (3  7)  follows  □ 

4.  COMPARISONS  OF  BOUNDS  IN  R 

Throughout  this  section  <p.  is  a  given  convex  differentiable 
convex  function  and  X  is  random  variable  with  distribution  F  and  density  f 
Luith  support  (a,b).  (JUe  compare  the  upper  bounds 

C  =  EX$’(X)  -  4»*(E4>,(X))  and  D  with  the  following 

E4>’(X) 

well  known  upper  bounds: 

Edmundson-Madansku  (19561 

Efvj.  _  (b-x)  <t>  fa)  ♦  (x-b)  4>  (b)  (4.1) 

b-a 


where  x:  =  E(X)  <  oo,  (a,b]  is  a  finite  interval. 
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Bert-Tal-Hothman  1)9721 


BH  -  d  (  M  ♦  ^ 

2  x-a  tr* 


<t>(x)  1 1  -  ~ 


fb-a) 

fb-xl  (x-a) 


] 


where  x  =  EIXI  *  •*>  (a,b)  is  a  finite  interval,  and 


(4.2) 


b  x 

d  *  E  iX  -  x|  =  z{  Ix-x)  dF(x)  -z\  (x-x)  dF(x)  is  the 
2  a 

expected  absolute  deviation  about  the  mean.  Using  this 
additional  information  on  the  random  variable  X.  it  is  shown 
that  BH  gets  closer  to  E$(X).  =  $  than  EM  i.e,  i  BH  $  EM 


Remark  41  The  upper  bound  BH  can  be  obtained  for  an  infinite  interval 
(a,b),  -r-*Ma<b<  +  <»,  under  additionals  assumptions  on 
4>,  see  Ben-Tal  and  Hochman  (1972).  □ 

Example  4.1  <$>(X)  *  x2,  x>o,  X~U  (0,1)  Then  x  =  1/2,  d»1/4,  <$>  =  1/3. 

Using  (4.1)  and  (4  2)  we  obtain  EM  =  1/2  and  BH  =  3/8. 

Here  4>*  (y)  =  1/4  y2  and  then  we  compute  C  =  5/12, 

0  =  4/9  Jensen's  inequality  yeilds  the  lower  bound 
J  =  1/4  and  J<^$BH$C$D$  EM.  □ 

The  next  example  illustrates  a  situation  where  EX<$>’(X)  and  E<$>'(X) 
(and  hence  C  and  0)  are  easy  to  compute  while  E<$>(X)  requires  the 
evaluation  of  a  complicated  integral.  Moreover,  in  this  example,  the 
upper  bounds  BH  and  EM  are  shown  to  be  trivial,  i.e.,  BH=EM2  +  °° 


10 


Example  4  2  Let  $(x)  *  -Ln  (1-x2)  -l$x.<J,  and  let  X  be  a  random  variable 
ujith  density  f(x)=3/2(l-x2)  for  o$x$1 

i 

Then,$  =  -  — J  1 1-x2)  Ln  (1-x2)  dx  14  3) 

2o 

To  compute  the  integral  (4.3)  u/e  use  the  following  known 
integrals  (see  e  g.  Gradshteyn  and  Ryzhik  (1980)  pp.  557-558): 

J*0  xA'!  Ln  (1+x)  dx  =  1/A  (Ln2-f3(A+l))  (A>-1) 

J\>  xA_1  Ln  (1-x)  dx  =  1/A  (cp(D  -  $  (A  +  1))  (A>-1) 

where  $(1)  =  -y ,  y  =  0.5??215...Euler's  constant  and  the  functions 
$,  p  satisfy 

$(x+1)  =  vp(x) ,  $(— }-$(-)  =  2p(x),  x>0  (see  e  g.  (I2)p  945). 
x  2  2 

After  some  algebraic  manipulations  one  obtains  $  s  5/3  -  2Ln2 
~  0.280372. 

The  conjugate  function  is  given  by 

$*  (y)  =  Vi+y^-1  ♦  Ln  (1  -  (£±Lii)2)  . 

y 

EX$'(X)  =  3jJ0  x2dx  =  1,  E$'(X)  =  xdx  =  3/2. 

Hence,  $'  (E$'(X))  *  $*(3/2)  a  0.4653  and  thus  O  0.53468  and 
0  =  $  (2/3)  3  9/5  =*  0.58778. 

Jensen's  inequality  yields  the  lower  bound  J  =  $(x)  =  $(3/8)  ^ 
0.1515.  fl  rough  estimate  of^  could  be  obtained  by  averaging  the  upper 
bound  C  and  the  lower  bound  J  to  give  0.34309. 

Finally  we  note  that  since  here  $(1)  =  ♦<»,  EM  =  BH  s  +<».  (It  can 
be  verified  that  d  *0.)  □ 


In  Theorem  3  2  we  show  that  C  $  D,  and  Ben-Tal  and  Hochman 
prove  that  BH  $  EM  Examples  4.1  and  4.2  illustrate  situations  where 
C  ^  0  ^  EM.  In  fact  we  tested  many  other  examples  and  found  C  $  EM 
This  inequality  is  not,  however,  always  valid  as  illustrated  in  our  next 
example. 

Example  4  3  Let  <fr(X)  =  1  -  (1  -  (x  -  1)2)1/2,  0$X*2  and 

fix)  =  — (1  -  (1  -  (x  -  nY2,  0<X« 2. 

4  -  n 

Then,"x  =  I,  EM  *  1,  BH  *  0.7766,  J  =  <fr(x)  =  0  and  $  =  E*(X)  *  0.447  The 
conjugate  <p*  is  <t>*  (y)  ■  y(1  +  -ik_)  ♦  ..1  ,  -  1,  (4.3) 

f  2  12 

V  i+y  vi+y 

EX4>‘(X)  =  “  2.1065,  and  E4>'(X)  *  0. 

3(4 -tt) 

Hence  4>*  (E4>1X))  =  4>*[0)  =  0  and  then  C  ^  2.1065  >  EM  =  1.  □ 

Note  that  the  bound  0  is  not  computable  here  since  the 
assumption  of  Theorem  2.1  E$'{X)>0  is  violated.  The  example  not  only 
demonstrates  that  EM  is  better  than  C,  but  also  that  the  bound  C  may  be 
a  “bad"  upper  bound.  However,  we  show  in  Section  6  that  the  bound  C  can 
be  considerably  improved  to  be  even  sharper  than  BH;  see  Theorem  61 
and  Example  6.2. 

UJe  have  already  mentioned  in  the  introduction,  that  the  compu¬ 
tation  of  the  upper  bounds  EM  and  BH  requires  a  finite  mean  x.  This  is  not 
the  case  for  C.  Further  BH  requires  the  value  of  d  which  may  be  difficult 
to  compute.  In  the  last  example  of  this  section  we  consider  the  case 
when  x  and  d  fail  to  exist  and  therefore  the  upper  bounds  EM  and  BH  are 
not  available 


Example  4  4  Let  X  be  a  random  variable  with  density 

fix)  = - - — —  o  <  x  <  ♦  oo  and  let  <t>(x)  ■  -  2  -/x 

TT(1  ♦  X‘) 

f - 

Then  ^  ~  dx  =  -2  VI »  -2.8284,  and  EX4>‘(X)  =-72, 

77  1  ►  x 

EX<t>'(X)  =  -VI.  HenceC  =  -VI  -  <t>*  (-71)  =  -  2. 1213.  □ 

5  COMPARISONS  OF  BOUNDS  IN  Rn 

Gassmann  and  Ziemba  (1986)  extend  an  idea  of  Edmundson  and 
Madansky  (1956)  to  derive  an  upper  bound  on  the  expected  value  of  a 
convex  function  of  a  random  vector.  The  bound  is  given  as  the  solution 
of  the  following  linear  program:  (see  Gassmann,  Ziemba  (1986),Theorem  1) 

m  m  m  - 

GZ  =  max  (Z4>(Vi)  Aj:  2  A  s  =  1 , 2  Av.=  x,Aj^o)  (51) 

X  M  i=i  1  i  =  1  1  1 

where  <t>.  S-*fi  is  a  convex  function,  S  C  Rn  is  convex,  (vj, ... ,  Vm)  are  the 
extreme  points  of  a  bounded  convex  polyhedron  containing  S,  and 
x  =  (EX] , ... ,  EXn)T  is  the  finite  mean  of  the  random  vector  X.  We  compare 

the  bound  6Z  with  the  upper  bound  derived  in  Theorem  3.1: 

C  =  EXtV4»(x)-4>*(EV4>(x)). 

Example  5.1  (Taken  from  Gassman-Ziemba  (1986)  p.  42.) 

3^)^  jf  x^+x*  $  1 
0  otherwise 


13 


2  2 

Then  S  =  {(x,,x2)  :  x,  +  x2  ^  1}  and 


i  =  EeX1s-5- 
2tt 


t  y1 i-xf  | 

j  J  e*v(  -  Xj  -  Xj  d  x,  d  Xj  ■  —  =  1.0364 

Vk  e 


Using  (5.1)  the  best  upper  bound  derived  in  111]  is  shown  to  be 
GZ  *  1.54308.  llle  now  computp  the  hnunrt  r  The  conjugate  of 

x2)  -  ex>  is  (y^  y2)  =  yj  Lnyj  -yj  and  EV<J>(X)  =  (Eexi,  0]T  = 


■7  T  7  V 

(-  ,  o)  (using  5  2),  then  <t>*  (tv<$>(xjj  =  <K  (-,  Uj  =  (-  (Ln  3-2)  «  -  u.yy48. 
e  e  e 

Mow, 

EXVm  -  «X,  e'l  ■  jl,  /  x,  e'  j  *?,->$  dx,  dx,  - 

3p2_7i 

— — '  =*  0.2146  and  then  C.  =*  1.20948  <  GZ  *  1.54308. 
ze 

Mote  that  Jensen's  inequality  yields  the  lower  bound  J  ■  $  (x)  =  $  (0,0)  =  1 
and  thus  an  estimate  of  4>  could  be  obtained  by 


J  ♦  C 


=*  1.10474  giving  an  error  of  about  7%. 


For  a  random  vector  X  =  (Xj, ...  Xn)T  with  independent  components  X^ 

the  Edmundson-Madansky  and  Ben-Tal-Hochman  upper  Bounds  are 
available  where  S  is  an  n-dimensional  rectangle  of  the  form 

S  =  Ti  [a.  ,  b.j  .  They  are  given  by  the  following  expressions  (1): 


Then,  4*  =  E4>  (x  xj  =  -  Using  (S3)  and  (5.4)  we  obtain 
respectively  £M*-i-(4>{0,0)  ♦  4*  (0,1)  ♦  4-0,0)  ♦  441,1))  =  1. 

and  BH  =  —  (4>(0.0)  ♦  440.1)  ♦  4>(1,0)  ♦  *  (1.1))  ♦  -  (4>  (0,^)  ♦  4>(io)* 
16  3  2  2 


4>(1,  -1  ♦  4>(-,  1))  ♦  — ) »—  From  (51)  uue  have: 

2  2  4  2  2  3 


GZ-rnaxl-^-^ZVA,*  V  V  V'-V  V|- 

A2  +  A4  =  2^  maX  **  "  ^  °  *  A2  *  9*  ^  1 
Now  iue  compute  the  conjugate  function  at  the  point  x*  =  (X|*,  x2*\J 
♦*lx*  x!)«±  (x**e)TFi(x*+e),  f)  =  |  f1  ~h.  e  *I1,1)T. 

12  2  4  -1  t> 

and  E  V  $(xJ#  x2)=  (2,0)T,  ExTV<Hx)  =  Then  4»*(E  V4>(x))  = 

4>*  (2,0) 3  1.  This  yields  the  upper  bound  O  and  we  have 
$  <  BH  <  C  <  EM  <  GZ. 


Finally  Jensen's  inequality  yields  the  lower  bound  J  =  <Mx)  = 

♦(— ,  — )  *  0.  (Note  incidently  that J  4  c-  =  4  =  —  !) 

2  2  2  A 


The  bound  C  is  in  fact  sharp  lie ,  C  =  4>)  for  many  examples 
For  example,  consider  a  piecewise  linear  function  of  the 
form: 


where  t  is  fixed  This  is  the  form  of  the  recourse  function  in 
stochastic  linear  programming  Isee  UUets  11966)1  In  this  case 


4>*(\ 


k 

1  =  inf  X 
1=1 


A  4>. 


(v.) 

i 


I  A  H.  I 


A.V  =  V 

i  i 


A  01 


15  6i 


from  Rockafellar  11970),  Theorem  16  5,  inhere 


■J>  I v'l  *  sup  (vTx  -  (x-tJTTr'l  '5  71 

'  rr't  if  v1  =  tt’ 

.  - 

,  +  oo  otherwise 
From  15  71, 


rvt.  if  v€  co  (n*  1 1  - 1, k) 


= 


oo  otherwise 


158) 


mins*  wji  wwviwi^wnjinw'^AwwwuwwTiwwiinwiiAwww/WAwwwnw«nww)w\wi. 


Let  4>(x)  =  (x-t)Trr(x)  (ttIx)  not  necessarily  unique)  and  let 
\7<Hx)  =  tt(x),  then  El^fx)  =  El(x-t)1  jrlx)) 

=  E(xT7«ft<))  -  E  (tTTT(x)) 


=  Elx  7<$>(x))  -  $>*(E(7$(X))). 


mhere  me  note  that  El7$(x))  e  co  (rr1 1  i =1, ....  k)  Hence,  $=C  (5-9) 


6.  REFINING  THE  UPPER  BOUNDS  IN  R  AND  Rn 


The  upper  bound  C  derived  in  Theorem  3.1  can  be  naturally  sharpened  in 
the  one  dimensional  case,  mhen  X  is  a  continuous  random  variable  uuith 
density  function  fix). 


Theorem  6.1 


Let  <t>  R  -»R  be  a  convex  differentialbe  function  and  X  a  random 


variable  rnith  support  (a,b)  and  density  fix).  Then, 


u  ^ 

E't>(X)  $1  (K  la.b)  -  J  x$(x)  fix)  dx  -  <t>*  (E4>(X):=  C  . 


mhere  K(a,b):  =  b$(b)f(b)  -  a4>la)f(a) 


Moreover  C  is  sharper  than  C,  i.e.,  C  $  c 


Proof: 


From  Theorem  31  in  R,  E4>(x)  *  C  ■  EX  $'(X)  -  $*(E$'(X)) 


w 

Integrating  by  part  EX  <t>'(X)  ■  \  x  <t»*(x)  f  (x)  dx  me  obtain  from  (6.4): 


E«J>(x)  $  (x$(x)  f  ix))^  -  <$>(x)  fix)  dx  -  x^lxJf’lxJdx-^lE^lX)) 


<  o 

I  • J 


=  K  (a,b)  -  E<fr(X)  -  J  x^ix)  fix)  dx  -  ♦*(E^1X)) 


and  then  16.1)  follows.  To  show  that  C  .<  c,  observe  that 


C  =  1  (EX4>‘(X)  +  £4>(X)  -  4>*(E<t>‘(X))  =  -MC  +  E<*>  (X))  and  that  E$(X)  *  C. 
implying  (6  3)  □ 


Example  6.1 

UJe  reconsider  Example  4.3  given  in  Section  4. 

2 

K(a,b)  ■  K(0,2)  =  and  x$tx)f'(x)  dx  *  5^-.  This  yields  the 
4-tr  o  314-jr) 

upper  bound  C  »  «  12766  <  C  *  2.1065.  Note  that  we 

6(4— TT) 

.“w 

still  have  EM=1  <  C;  but  see  Example  6.2. 


It  is  interesting  to  note  that  when  X  ~  U  (a,b)  then  the  upper 
bound  C  is  better  than  EM.  For  if  X  ~  U  (a,b)  then  EM  -  ~ 

and  C  =  1  (b  £  ♦  $  (a)  -  <J>*  O  where  £  =  e  (4>'(a),  <fr'(b)l 

2  b-a 

(since  $  is  convex). 

Hence,  from  the  inequality  $(b)  +  $*(£) )  b  £,  it  follows 
immediately  that  C  $  EM. 


Following  Remark  3.1  of  Section  3,  we  can  further  sharpen  the  upper 
bound  C  by  using  a  lower  bound  established  by  Ben-Tal  and  Hochman 
0972)  which  is  better  than  Jensen's  lower  bound.  Let  g:  be  a  convex 

function,  and  Y  a  random  variable  with  support  (a,b)  then: 


EglYI  >  (3g(y  .  A)  ,  (i-p)  g  iy  -  ^  I:  -  Lg  lp.dl 


f6.6) 


and  L  (p.d)  >,  qlE(VJ).  where  y  =  E(V)  <  oo,  d  =  E  I V  -  Q  |  and  (3:  =  Prl'£y) 
From  (3  51  we  have  E$(X}  =  EX4>‘(X)  -  E$WX]]  (6  7) 


Applying  (6.6)  with  g:  «  and  V:  =  <p‘  (K)  one  obtain:  line  denote 
$’:-EW)) 


E<t>*(*'tMM  >  |34>*  (♦'»  A)  ♦  (t-p)  >  (**  (EftX)ll  (£■«> 

and  hence  from  (6.7)  this  implies 

E$(X)  *  EX  *’(X)  -L  (d,p) :  *CR*  C.  ^ '  °J 

ujhere  p  =  Pr  (  $'(X)  >y  $ )  and  d  =  E  |  <t>'(X)  -  4>‘  l 
Furthermore,  if  X  is  a  random  variable  with  density  fix),  even  sharper 
bounds  are  possible.  Following  the  proof  of  Theorem  6.1  together  with 
(6.9)  we  obtain 

b 

CR  :=|(K{a£)-  f  x<Mx)f  (x)dx  - 1  (d,p))  (6 10) 

i  2  :  4> 

o 

Clearly,  CR  $  and  dj*  $  Cy  fl  natural  question  is  whether 
orC^  is  better  than  BH?  It  seems  difficult  (if  not  impossible) 


to  prove  such  a  result  in  general.  However  in  our  worst 
examples  (4.3  and  6.1)  this  appears  to  be  true  as  demonstrated 
below. 


i  jtv  wv  wv  wu  w\  w  rv  *r j  wj  ww  vu 


>-  jy  ,»** ,««»  ,*.*■ 


'J»W.rW»'.p»V»’.  WJ  r-J 


WIWCT. 
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Example  6.2 

Life  have  already  computed  in  Example  4  3^nd  61^-0  44? 

BH  *  0  7766,  EX*‘(Xj  a  2.107,  K  10,2)  »  |x<t»lxjf(x)dx=^± 

Arn  o  314-rr) 

For  the  random  variable  <t>'(X)  me  compute  p  =  —  and  d  =— — 

2  4-tt 

Then  using  the  conjugate  <J>*  given  in  (4.3)  me  obtain 
L  ,  (d,  p)  =  1  (4>*(d)  ♦  <t>*  (-d)J  *  1.5354.  Using  (6.9)  and  (6.10) 

▼  C 

It  folio uus  that  cfj  a  0  5711  and  (?,=  0.5088  Thus, 

$  <  C?<  C*)  <  6H  <  EM  <  C  <  C.  □ 


UUe  norn  turn  to  the  problem  of  refining  the  bound  C  in  IRn.  For  a  random 

vector  X  =  (xj, . ,Xn)  m.th  independent  components  X,  the  BerrTal 

Hochman  lomer  bound  is  available  uuhen  S  is  an  n  dimensional  rectangle 


n 

of  the  form  S  =  n  (a.,  h),  and  is  given  by: 
M  1  1 

Vfmfyhto;. . • 

2  k-1  1  1  « 


1611) 


where  h:  fl"  -»IR  is  a  convex  function,  <$k  is  1  or  2,  is  the 
set  of  2°  n-dimensional  vectors  uuhose  components  are  all 


-  .  4c 


k  - 


1*s  or/and  2*s,  p*  =  p*  =  1-a  ,  a*  =  x  ♦  --  ,  a*  =  x- ■ - * — 

1  k  1  ^  2^  2  >.  2(1-pj.) 


and  d  ,  p. ,  x.  denote  the  corresponding  parameters  of  X. 


To  apply  (6.11)  as  in  (6  7)  and  (6.8)  requires  shoming  that  the  random 
vector 


,  ■  M  »  I 

.•v-v* 


r 


iJ  I.*  L>  M 


3 


7  •{■  lx  )  has  independent  components,  and  computing 

the  corresponding  parameters  p  and  dk  This  fairly 

complicated  task  can  be  avoided  by  characterizing  the 
class  of  functions  for  which  the  function  4»(-)-4>*(V4>(-)) 
is  convex.  Moreover  this  allows  us  to  express  the  new 
upper  bound  explicitly  in  terms  of  the  problem’s  data 
(without  requiring  knowledge  of  4>*jj  see  Theorem  6.2. 

The  following  result  gives  a  sufficient  condition  for 
to  be  convex  for  a  large  class  of  functions  arising  in 
applications. 


Lemma  6.1 


Let  <$> :  Self  -♦  (R  be  a  given  twice  continuously  differentiable 
function.  If  g:  IB"  -» IRn,  g(z] :  =  7  <$>(z)  is  convex,  then 


•!>(zj .  *  <t>*{7<t>(z))  is  convex. 


Proof 


<\>  is  a  convex  function  if  and  only  if  it  satisfies  the  gradient 
inequality,  i.e.,  for  any  x  and  y  in  S 
•V(x)  -  4»(yl  >,  (x-y)T  4>lyi 


(6.12) 


By  definition.  (y)  =  <t>*  (V  <t>(y))  and  thus  V  <|4y)  =  V2  4>(y)V4>*(V4>(y)) 


where  T2  <My)  denotes  the  Hessian  of  i-  Using  V  4>+  =  (V  4>f\ 
•t  follows  that  7«My)  =  V  2  #y)  y.  Inequality  16  12)  then 


becomes 

IV  4>txll- (7  4>lyJj  >  (x-yJT7  2  4>(yjy  (613) 


Now,  since  4>*  is  convex,  applying  the  gradient  inequality  to 
<t>*  and  using  7  <$>*  =  (  74>)”\  we  obtain 


♦*  (V  $(x))-  <f>*  (7  4>(y))  )  (7  4>(x)  -  74>(y))Ty 

Thus  to  prove  (6 13)  it  is  sufficient  to  show 

(74>(X)-74>(y))T  y^  (x-y)V  4>(y)  y  16.14) 

Since  g(x):  =  7$(x)  is  assumed  convex,  we  have 

7<$>(x)  -  7<|>(y)  \  (x-y)  72  <My).  (6  15) 

Multiplying  (6.15)  by  y^o  (recall  that  ScR11^ yields  (6  14) . 


□ 

LUe  can  now  derive  a  refined  upper  bound  for  a  random  vector  X  with 
independent  components  Xj.  UJe  make  the  following  assumptions: 


(II  $  =  ft  (a,  b)  c 

i=1  1  1  + 


(II)  7$M  is  convex 


Theorem  6  ? 

Suppose  (I)  and  (II)  hold.  Then, 
E*(X)  SEXtV*(XI  -  .  CR  where 


a{  '"W 


f  t 

1 


(6.16) 


f  f 


(6  17) 


Proof 

From  (6.7)  (in  ff)nl  we  have 

E44X]  =  EHT7*IX)-E<HWX)J 

Under  Assumptions  (I)  and  III),  Lemma  6  I  is  applicable  and  thus 
E^IX)  =  E<fr*(V<t>(Xl)  as  defined  in  (611)  Moreover, 

using  'p(zj  =  zT7'f>(z!  -  ‘fclzj,  the  expression  (6  !?J  follows. 
Example  6  5 

UUe  reconsider  Example  5.2  where  we  already  computed 

$  =  0.25,  EM  =  0.75,  C  =  0  50,  BH  =  0.375,  GZ  =  1,  EXT74>(Xt  =  1.5  and 
d  =  d  =  —  x.  =  x  =  — .  The  assumotions  (I)  and  III)  are  clearlu 

1  2  4  1  2  2 

satisfied  and  thus  Theorem  6.2  is  applicable. 

We  compute  =  p2  =  ^  and  thus 

Lt  ‘  \  Mf ’I '  *  *(f '  4*  *  *$■  4>  *  +  (  ‘  '  A  ^ 

with  *  (z,,z2IT  V4.tz ,,  Z2I  -  ♦Iz,,  z2)  =  |  z*  <  |  z*  *  z,  z2 

Then  =  1 1875  which  yields  the  upper  bound  CR  =  0.3125  <BH=0  375 
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7.  CONCLUSIONS 


lUe  have  given  new  upper  bound?-  f or  th?  r*>:p ~ r t -jti —  -if - 

function  using  gradient  and  convex  conjugate  function  information.  LUe 
have  shown  that  these  bounds  and  their  extensions  can  be  better  than 
previous  bounds  in  several  examples,  (lie  also  demonstrated  how  our 
bounds  are  especially  useful  when  the  original  integral  is  complicated 
but  has  a  gradient  that  can  be  easily  integrated  or  when  the  information 
required  for  other  bounds  (e  g.,  moments)  is  not  available.  The  new 
bounds  are  then  applicable  in  a  variety  of  applications  with  these 
characteristics. 
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